SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Thomas/ Terry L. 



(ii) TITLE OF INVENTION: PRODUCTION OF GAMMA LINOLENIC ACID BY A 

DELTA 6 - DESATURASE 

(iii) NUMBER OF SEQUENCES: 27 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Scully, Scott, Murphy & Presser 

(B) STREET: 4 0,0 Garden City Plaza 

(C) CITY: Garden City 

(D) STATE: New York 

(E) COUNTRY: United States 

(F) ZIP: 11530 

y,' (v) COMPUTER READABLE FORM: 

~ (A) MEDIUM TYPE: Floppy disk 

5* (B) COMPUTER: IBM PC compatible 

'jf (C) OPERATING SYSTEM: PC - DOS/MS - DOS 

tf? (D) SOFTWARE: Patentln Release #1.0, Version #1.25 

yj (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 
r (B) FILING DATE: 

t: (C) CLASSIFICATION: 

s U 

HJ(viii) ATTORNEY/ AGENT INFORMATION: 

H (A) NAME: Presser, Leopold 

(B) REGISTRATION NUMBER: 19,827 

(C) REFERENCE/DOCKET NUMBER: 8383ZYXWVU 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (516) 742-4343 

(B) TELEFAX: (516) 742-4366 

(C) TELEX: 230 901 SANS UR 




(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 



(A) NAME /KEY : CDS 

(B) LOCATION: 2002.. 3081 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



GCTAGCCACC 


AGTGACGATG 


CCTTGAATTT 


GGCCATTCTG 


ACCCAGGCCC 


GTATTCTGAA 


60 


TCCCCGCATT 


CGCATTGTTA 


ATCGTTTGTT 


CAACCATGCC 


CTGGGTAAAC 


GTTTAGACAC 


120 


CASCTTGCCA 


GACCACGTTA 


GTTTGAGTGT 


TTCCGCCCTG 


GCGGCCCCGA 


TTTTTTCCTT 


180 


T(|§GGCTTTG 


GGCAATCAGG 


CGATCGGGCA 


ATTGCGTTTG 


TTTGACCAGA 


CTTGGCCCAT 


240 


tcIggaaatt 


GTCATTCACC 


AAGACCATCC 


CTGGCTCAAT 


TTACCCCTGG 


CGGATTTATG 


300 


GG^TGATCCG 


AGC C GAATGT 


TGATCTATTA 


CCTACCGGCC 


CACAGTGAAA 


CGGATTTAGT 


360 


AGfecGCAGTG 


GTGAATAATT 


TAACGTTGCA 


ATCTGGGGAC 


CATTTAATAG 


TGGGACAAAA 


420 


ACgjcCAACCC 


AAGACCAAAC 


GGCGATCGCC 


TTGGCGCAAA 


TTTTCCAAAC 


TGATTACCAA 


480 


CC^fGCGGGAG 


TATCAGCGGT 


ATGTCCAACA 


GGTGATATGG 


GTGGTGTTGT 


TTTTATTGTT 


. 540 


GA^FGATTTTT 


CTGGCCACCT 


TCATCTACGT 


TTCCATTGAT 


CAACATATTG 


CCCCAGTGGA 


600 


CGCGTTGTAT 


TTTTCCGTGG 


GCATGATTAC 


CGGGGCCGGT 


GGCAAGGAAG 


AGGTGGCCGA 


660 


AAAGTCCCCC 


GATATCATCA 


AAGTATTCAC 


AGTGGTGATG 


ATGATCGCCG 


GGGCGGGGGT 


720 


GATTGGTATT 


TGTTATGCCC 


TACTGAATGA 


TTTCATCCTT 


GGCAGTCGCT 


TTAGTCAGTT 


780 


TTTGGATGCG 


GCCAAGTTAC 


CCGATCGCCA 


TCACATCATC 


ATTTGTGGGC 


TGGGGGGAGT 


840 


GAGCATGGCC 


ATTATTGAAG 


AGTTAATTCA 


CCAGGGCCAT 


GAAATTGTGG 


TAATCGAAAA 


900 







m 




m 






GGATACAGAT 


AATCGTTTCT 


TGCATACGGC 


CCGCTCCCTG 


GGGGTGCCCG 


TAATTGTGGA 


960 


GGATGCCCGC 


CTAGAAAGAA 


CGTTGGCCTG 


CGCCAATATC 


AACCGAGCCG 


AAGCCATTGT 


1020 


GGTGGCCACC 


AGCGACGACA 


CCGTTAACTT 


GGAAATTGGC 


CTAACTGGCA 


AGGCGATCGC 


1080 


CCCTAGCCTG 


CCAGTGGTGT 


TGCGTTGCCA 


GGATGCCCAG 


TTTAGCCTGT 


CCCTGCAGGA 


1140 


AGTATTTGAA 


TTTGAAAC GG 


TGCTTTGTCC 


GGCGGAATTG 


GCCACCTATT 


CCTTTGCGGC 


1200 


GGCGGCCCTG 


GGGGGC AAAA 


TTTTGGGCAA 


CGGCATGACC 


GATGATTTGC 


TGTGGGTAGC 


1260 


CCTAGCCACC 


TTAATCACTC 


CTAACCATCC 


CTTTGCCGAC 


CAATTGGTTA 


AAATTGCAGC 


1320 


CCAAAAGTCT 


GATTTCGTTC 


CCCTCTATCT 


AGAACGGGGT 


GGCAAAACCA 


TCCATAGCTG 


1380 


GGAATTATTG 


GGTACCCATC 


TCGACTCTGG 


AGACGTGTTG 


TATTTAACCA 


TGCCCGCCAC 


1440 


TGppCTAGAG 


CAACTTTGGC 


GATCGCCCCG 


TGCCACTGCT 


GATCCTCTGG 


ACTCTTTTTT 


1500 


ggWttagcat 

i y 


GGGGGGATGG 


AACTCTTGAC 


TCGGCCCAAT 


GGTGATCAAG 


AAAGAAC GCT 


1560 


TT^CTATGT 


TTAGTATTTT 


TAAGTTAACC 


AACAGCAGAG 


GATAACTTCC 


AAAAGAAATT 


1620 


aaW:tcaaaa 


AGTAGCAAAA 


TAAGTTTAAT 


TCATAACTGA 


GTTTTACTGC 


TAAACAGC GG 


1680 


tggaaaaaag 


TCAGATAAAA 


TAAAAGCTTC 


ACTTCGGTTT 


TATATTGTGA 


CCATGGTTCC 


1740 


caBgcatctg 


CTCTAGGGAG 


TTTTTCCGCT 


GCCTTTAGAG 


AGTATTTTCT 


GCAAGTCGGC 


1800 


n \ 

TAMZTCCCCC 


ATTTTTAGGC 


AAAATCATAT 


ACAGACTATC 


CCAATATTGC 


CAGAGCTTTG 


1860 


S_ I 

ATGACTCACT 


GTAGAAGGCA 


GACTAAAATT 


CTAGCAATGG 


ACTCCCAGTT 


GGAATAAATT 


1920 


TTTAGTCTCC 


CCCGGCGCTG 


GAGTTTTTTT 


GTAGTTAATG 


GCGGTATAAT 


GTGAAAGTTT 


1980 


TTTATCTATT 


TAAATTTATA 


A ATG CTA ACA GCG GAA 
Met Leu Thr Ala Glu 
1 5 


AGA ATT AAA TTT ACC 
Arg lie Lys Phe Thr 

10 


2031 



CAG AAA CGG GGG TTT CGT CGG GTA CTA AAC CAA CGG GTG GAT GCC TAC 
Gin Lys Arg Gly Phe Arg Arg Val Leu Asn Gin Arg Val Asp Ala Tyr 

15 20 25 



-9*- 



TTT GCC GAG CAT GGC CTG ACC CAA AGG GAT AAT CCC TCC ATG TAT CTG 2127 
Phe Ala Glu His Gly Leu Thr Gin Arg Asp Asn Pro Ser Met Tyr Leu 
30 35 40 

AAA ACC CTG ATT ATT GTG CTC TGG TTG TTT TCC GCT TGG GCC TTT GTG 2175 
Lys Thr Leu lie lie Val Leu Trp Leu Phe Ser Ala Trp Ala Phe Val 
45 50 55 

CTT TTT GCT CCA GTT ATT TTT CCG GTG CGC CTA CTG GGT TGT ATG GTT 2223 
Leu Phe Ala Pro Val lie Phe Pro Val Arg Leu Leu Gly Cys Met Val 
60 65 70 

TTG GCG ATC GCC TTG GCG GCC TTT TCC TTC AAT GTC GGC CAC GAT GCC 2271 
Leu Ala lie Ala Leu Ala Ala Phe Ser Phe Asn 'Val Gly His Asp Ala 
75 80 85 90 

AAC CAC AAT GCC TAT TCC TCC AAT CCC CAC ATC AAC CGG GTT CTG GGC 2319 
Asn His Asn Ala Tyr Ser Ser Asn Pro His lie Asn Arg Val Leu Gly 

95 100 105 

Alg ACC TAC GAT TTT GTC GGG TTA TCT AGT TTT CTT TGG CGC TAT CGC 2367 
MeM Thr Tyr Asp Phe Val Gly Leu Ser Ser Phe Leu Trp Arg Tyr Arg 
l Z 110 115 120 

CAW AAC TAT TTG CAC CAC ACC TAC ACC AAT ATT CTT GGC CAT GAC GTG 2415 
Hitf Asn Tyr Leu His His Thr Tyr Thr Asn lie Leu Gly His Asp Val 

■ ifl 125 130 135 

GA& ATC CAT GGA GAT GGC GCA GTA CGT ATG AGT CCT GAA CAA GAA CAT 2463 
GlQj lie His Gly Asp Gly Ala Val Arg Met Ser Pro Glu Gin Glu His 

|y 140 145 . 150 

GTg GGT ATT TAT CGT TTC CAG CAA TTT TAT ATT TGG GGT TTA TAT CTT 2511 
Vaiik Gly lie Tyr Arg Phe Gin Gin Phe Tyr lie Trp Gly Leu Tyr Leu 
15B 160 165 170 

TTC ATT CCC TTT TAT TGG TTT CTC TAC GAT GTC TAC CTA GTG CTT AAT 2 559 

Phe lie Pro Phe Tyr Trp Phe Leu Tyr Asp Val Tyr Leu Val Leu Asn 

175 180 185 

AAA GGC AAA TAT CAC GAC CAT AAA ATT CCT CCT TTC CAG CCC CTA GAA 2607 
Lys Gly Lys Tyr His Asp His Lys lie Pro Pro Phe Gin Pro Leu Glu 
190 195 200 

TTA GCT AGT TTG CTA GGG ATT AAG CTA TTA TGG CTC GGC TAC GTT TTC 265 5 

Leu Ala Ser Leu Leu Gly lie Lys Leu Leu Trp Leu Gly Tyr Val Phe 
205 210 215 

GGC TTA CCT CTG GCT CTG GGC TTT TCC ATT CCT GAA GTA TTA ATT GGT 2 703 

Gly Leu Pro Leu Ala Leu Gly Phe Ser lie Pro Glu Val Leu lie Gly 
220 225 230 

GCT TCG GTA ACC TAT ATG ACC TAT GGC ATC GTG GTT TGC ACC ATC TTT 27 51 

Ala Ser Val Thr Tyr Met Thr Tyr Gly lie Val Val Cys Thr lie Phe 
235 240 245 250 




ATG CTG GCC CAT GTG TTG GAA TCA ACT GAA TTT CTC ACC CCC GAT GGT 2799 
Met Leu Ala His Val Leu Glu Ser Thr Glu Phe Leu Thr Pro Asp Gly 

255 260 265 

GAA TCC GGT GCC ATT GAT GAC GAG TGG GCT ATT TGC CAA ATT CGT ACC 2847 
Glu Ser Gly Ala lie Asp Asp Glu Trp Ala lie Cys Gin lie Arg Thr 
270 275 280 

ACG GCC AAT TTT GCC ACC AAT AAT CCC TTT TGG AAC TGG TTT TGT GGC 289 5 

Thr Ala Asn Phe Ala Thr Asn Asn Pro Phe Trp Asn Trp Phe Cys Gly 
285 290 295 

GGT TTA AAT CAC CAA GTT ACC CAC CAT CTT TTC CCC AAT ATT TGT CAT 2943 
Gly Leu Asn His Gin Val Thr His His Leu Phe Pro Asn lie Cys His 
300 305 310 

ATT CAC TAT CCC CAA TTG GAA AAT ATT ATT AAG GAT GTT TGC CAA GAG 2991 
lie His Tyr Pro Gin Leu Glu Asn lie lie Lys Asp Val Cys Gin Glu 
3m 320 325 330 

t£t GGT GTG GAA TAT AAA GTT TAT CCC ACC TTC AAA GCG GCG ATC GCC 3 039 

Phia Gly Val Glu Tyr Lys Val Tyr Pro Thr Phe Lys Ala Ala lie Ala 

"yfj 335 340 345 

Tpr AAC TAT CGC TGG CTA GAG GCC ATG GGC AAA GCA TCG TGACATTGCC 3 088 

Ser Asn Tyr Arg Trp Leu Glu Ala Met Gly Lys Ala Ser 

350 355 360 



TpfGGGATTGA 


AGCAAAATGG 


CAAAATCCCT 


CGTAAATCTA 


TGATCGAAGC 


CTTTCTGTTG 


3148 


C&CGCCGACC 


AAATCCCCGA 


TGCTGACCAA 


AGGTTGATGT 


TGGCATTGCT 


CCAAACCCAC 


3208 


TpTGAGGGGG 


TTCATTGGCC 


GCAGTTTCAA 


GCTGACCTAG 


GAGGCAAAGA 


TTGGGTGATT 


3268 


TTGCTCAAAT 


CCGCTGGGAT 


ATTGAAAGGC 


TTCACCACCT 


TTGGTTTCTA 


CCCTGCTCAA 


3328 


TGGGAAGGAC 


AAACCGTCAG 


AATTGTTTAT 


TCTGGTGACA 


CCATCACCGA 


CCCATCCATG 


3388 


TGGTCTAACC 


CAGCCCTGGC 


CAAGGCTTGG 


ACCAAGGCCA 


TGCAAATTCT 


CCACGAGGCT 


3448 


AGGCCAGAAA 


AATTATATTG 


GCTCCTGATT 


TCTTCCGGCT 


ATCGCACCTA 


CCGATTTTTG 


3508 


AGCATTTTTG 


CCAAGGAATT 


CTATCCCCAC 


TATCTCCATC 


CCACTCCCCC 


GCCTGTACAA 


3 568 


AATTTTATCC 


ATCAGCTAGC 










3588 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS; 



(A) LENGTH: 359 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



71 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Thr Ala Glu Arg lie Lys Phe Thr Gin Lys Arg Gly Phe Arg 

1.5. 10 15 

Arg Val Leu Asn Gin Arg Val Asp Ala Tyr Phe Ala Glu His Gly Leu 
20 25 30 

Thr Gin Arg Asp Asn Pro Ser Met Tyr Leu Lys Thr Leu lie lie Val 
35 40 45 

Leu Trp Leu Phe Ser Ala Trp Ala Phe Val Leu Phe Ala Pro Val lie 
50 55 60 

Phe Pro Val Arg Leu Leu Gly Cys Met Val Leu Ala lie Ala Leu Ala 
65 70 75 80 



Ala Phe Ser Phe Asn Val Gly His Asp Ala Asn His Asn Ala Tyr Ser 
q 85 90 95 

Set Asn Pro His lie Asn Arg Val Leu Gly Met Thr Tyr Asp Phe Val 

** 100 105 110 

(My Leu Ser Ser Phe Leu Trp Arg Tyr Arg His Asn Tyr Leu His His 
U 1 115 120 125 

THr Tyr Thr Asn lie Leu Gly His Asp Val Glu lie His Gly Asp Gly 
nJ 130 135 140 

ma Val Arg Met Ser Pro Glu Gin Glu His Val Gly lie Tyr Arg Phe 
1135 150 155 160 

Gin Gin Phe Tyr lie Trp Gly Leu Tyr Leu Phe lie Pro Phe Tyr Trp 

165 170 175 

Phe Leu Tyr Asp Val Tyr Leu Val Leu Asn Lys Gly Lys Tyr His Asp 
180 185 190 

His Lys lie Pro Pro Phe Gin Pro Leu Glu Leu Ala Ser Leu Lqu Gly 
195 200 205 

lie Lys Leu Leu Trp Leu Gly Tyr Val Phe Gly Leu Pro Leu Ala Leu 
210 215 220 

Gly Phe Ser lie Pro Glu Val Leu lie Gly Ala Ser Val Thr Tyr Met 
225 230 235 240 

Thr Tyr Gly lie Val Val Cys Thr lie Phe Met Leu Ala His Val Leu 

245 250 255 

Glu Ser Thr Glu Phe Leu Thr Pro Asp Gly Glu Ser Gly Ala lie Asp 
260 265 270 




Asp Glu Trp Ala lie Cys Gin lie Arg Thr Thr Ala Asn Phe Ala Thr 
275 280 285 

Asn Asn Pro Phe Trp Asn Trp Phe Cys Gly Gly Leu Asn His Gin Val 
290 295 300 

Thr His His Leu Phe Pro Asn lie Cys His lie His Tyr Pro Gin Leu 
305 310 315 320 

Glu Asn lie lie Lys Asp Val Cys Gin Glu Phe Gly Val Glu Tyr Lys 

325 330 335 

Val Tyr Pro Thr Phe Lys Ala Ala lie Ala Ser Asn Tyr Arg Trp Leu 
340 345 350 

Glu Ala Met Gly Lys Ala Ser 
355 

(2b INFORMATION FOR SEQ ID NO: 3: 

,3==, 

S (i) SEQUENCE CHARACTERISTICS: 

SH (A) LENGTH: 1884 base pairs 

7i (B) TYPE: nucleic acid 

zJ (C) STRANDEDNESS: both 

ys (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

W (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 



AGSTTCACTT 

'i : 


CGGTTTTATA 


TTGTGACCAT 


GGTTCCCAGG 


CATCTGCTCT 


AGGGAGTTTT 


60 


TCCGCTGCCT 


TTAGAGAGTA 


TTTTCTCCAA 


GTCGGCTAAC 


TCCCCCATTT 


TTAGGCAAAA 


120 


TC AT AT AC AG 


ACTATCCCAA 


TATTGCCAGA 


GCTTTGATGA 


CTCACTGTAG 


AAGGCAGACT 


180 


AAAATTCTAG 


CAATGGACTC 


CCAGTTGGAA 


TAAATTTTTA 


GTCTCCCCCG 


GCGCTGGAGT 


240 


TTTTTTGTAG 


TTAATGGCGG 


TATAATGTGA 


AAGTTTTTTA 


TCTATTTAAA 


TTTATAAATG 


300 


CTAACAGCGG 


AAAGAATTAA : 


ATTTACCCAG 


AAACGGGGGT 


TTCGTCGGGT 


ACTAAACCAA 


360 


CGGGTGGATG 


CCTACTTTGC 


CGAGCATGGC 


CTGACCCAAA 


GGGATAATCC 


CTCCATGTAT 


420 


CTGAAAACCC 


TGATTATTGT 


GCTCTGGTTG 


TTTTCCGCTT 


GGGCCTTTGT 


GCTTTTTGCT 


480 


CCAGTTATTT 


TTCCGGTGCG 


CCTACTGGGT 


TGTATGGTTT 


TGGCGATCGC 


CTTGGCGGCC 


540 


TTTTCCTTCA 


ATGTCGGCCA 


CGATGCCAAC 


CACAATGCCT 


ATTCCTCCAA 


TCCCCACATC 


600 


AACCGGGTTC 


TGGGCATGAC 


CTACGATTTT 


GTCGGGTTAT 


CTAGTTTTCT 


TTGGCGCTAT 


660 


CGCCACAACT 


ATTTGCACCA 


CACCTACACC 


AATATTCTTG 


GCCATGACGT 


GGAAATCCAT 


720 



-fit- 



GGAGATGGCG 


CAGTACGTAT 


GAGTCCTGAA 


CAAGAACATG 


TTGGTATTTA 


TCGTTTCCAG 


780 


CAATTTTATA 


TTTGGGGTTT 


ATATCTTTTC 


ATTCCCTTTT 


ATTGGTTTCT 


CTACGATGTC 


840 


TACCTAGTGC 


TTAATAAAGG 


CAAATATCAC 


GACCATAAAA 


TTCCTCCTTT 


CCAGCCCCTA 


900 


GAATTAGCTA 


GTTTGCTAGG 


GATTAAGCTA 


TTATGGCTCG 


GCTACGTTTT 


CGGCTTACCT 


960 


CTGGCTCTGG 


GCTTTTCCAT 


TCCTGAAGTA 


TTAATTGGTG 


CTTCGGTAAC 


CTATATGACC 


1020 


TATGGCATCG 


TGGTTTGCAC 


CATCTTTATG 


CTGGCCCATG 


TGTTGGAATC 


AACTGAATTT 


1080 


CTCACCCCCG 


ATGGTGAATC 


CGGTGCCATT 


GATGACGAGT 


GGGCTATTTG 


CCAAATTCGT 


1140 


ACCACGGCCA 


ATTTTGCCAC 


CAATAATCCC 


TTTTGGAACT 


GGTTTTGTGG 


CGGTTTAAAT 


1200 


CACCAAGTTA 


CCCACCATGT 


TTTCCCCAAT 


ATTTGTCATA 


TTCACTATCC 


CCAATTGGAA 


1260 


AATATTATTA 


AGGATGTTTG 


CCAAGAGTTT 


GGTGTGGAAT 


ATAAAGTTTA 


TCCCACCTTC 


1320 


AAi^CGGCGA 


TCGCCTCTAA 


CTATCGCTGG 


CTAGAGGCCA 


TGGGCAAAGC 


ATCGTGACAT 


1380 


TGCETTGGGA 


TTGAAGCAAA 


ATGGCAAAAT 


CCCTCGTAAA 


TCTATGATCG 


AAGCCTTTCT 


1440 


GTTgCCCGCC 


GACCAAATCC 


CCGATGCTGA 


CCAAAGGTTG 


ATGTTGGCAT 


TGCTCCAAAC 


1500 


CCMTTTGAG 


GGGGTTCATT 


GGCCGCAGTT 


TCAAGCTGAC 


CTAGGAGGCA 


AAGATTGGGT 


1560 


GA'M'TTGCTC 


AAATCCGCTG 


GGATATTGAA 


AGGCTTCACC 


ACCTTTGGTT 


TCTACCCTGC 


1620 


TCA&TGGGAA GGACAAACCG TCAGAATTGT TTATTCTGGT 


GACACCATCA 


CCGACCCATC 


1680 


CATpTGGTCT 


AACCCAGCCC 


TGGCCAAGGC 


TTGGACCAAG 


(ICC AT 1 CZC* A A 2V 


fpT 1 C WC C A C CI A 




GGCTAGGCCA 


GAAAAATTAT 


ATTGGCTCCT 


GATTTCTTCC 


GGCTATCGCA 


CCTACCGATT 


1800 


TTTGAGCATT 


TTTGCCAAGG 


AATTCTATCC 


CCACTATCTC 


CATCCCACTC 


CCCCGCCTGT 


1860 


ACAAAATTTT 


ATCCATCAGC 


TAGC 








1884 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1685 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
AATATCTGCC TACCCTCCCA AAGAGAGTAG TCATTTTTCA TCAATGGCTG CTCAAATCAA 60 




7*1- 



GAAATACATT 


ACCTCAGATG 


AACTCAAGAA 


. CCACGATAAA 


. CCCGGAGATC 


TATGGATCTC 


120 


GATTCAAGGG 


AAAGCCTAlfe 


ATGTTTCGGA 


TTGGGTGAAA 


GACCATCCAG 


GTGGCAGCTT 


180 


TCCCTTGAAG 


AGTCTTGCTG 


GTCAAGAGGT 


AACTGATGCA 


TTTGTTGCAT 


TCCATCCTGC 


240 


CTCTACATGG 


AAGAATCTTG 


ATAAGTTTTT 


CACTGGGTAT 


TATCTTAAAG 


ATTACTCTGT 


300 


TTGTGAGGTT 


TCTAAAGATT 


ATAGGAAGCT 


TGTGTTTGAG 


TTTTCTAAAA 


TGGGTTTGTA 


360 


TGACAAAAAA 


GGTCATATTA 


TGTTTGCAAC 


TTTGTGCTTT 


ATAGCAATGC 


TGTTTGCTAT 


420 


GAGTGTTTAT 


GGGGTTTTGT 


TTTGTGAGGG 


TGTTTTGGTA 


CATTTGTTTT 


CTGGGTGTTT 


480 


GATGGGGTTT 


CTTTGGATTC 


AGAGTGGTTG 


GATTGGACAT 


GATGCTGGGC 


ATTATATGGT 


540 


AGTGTCTGAT 


TCAAGGCTTA 


ATAAGTTTAT 


GGGTATTTTT 


GCTGCAAATT 


GTCTTTCAGG 


600 


AAfj&AGTATT 


GGTTGGTGGA 


AATGGAACCA 


TAATGCACAT 


CACATTGCCT 


GTAATAGCCT 


660 


TG§ATATGAC 


CCTGATTTAC 


AATATATACC 


ATTCCTTGTT GTGTCTTCCA 


AGTTTTTTGG 


720 


TTgXCTCACC 


TCTCATTTCT 


ATGAGAAAAG 


GTTGACTTTT 


GACTCTTTAT 


CAAGATTCTT 


780 


TGll&AGTTAT CAACATTGGA CATTTTACCC 


TATTATGTGT 


GCTGCTAGGC 


TCAATATGTA 


840 


TGTACAATCT 


CTCATAATGT 


TGTTGACCAA 




TCCTATCGAG 


CTCAGGAACT 


900 


CT1SGGATGC 


CTAGTGTTCT 


CGATTTGGTA 


CCCGTTGCTT 


GTTTCTTGTT 


TGCCTAATTG 


960 


ggsS:gaaaga attatgtttg 


TTATTGCAAG 


TTTATCAGTG 


ACTGGAATGC 


AACAAGTTCA 


1020 


gt|J:tccttg 


AACCACTTCT 


CTTCAAGTGT 


TTATGTTGGA 


AAGCCTAAAG 


GGAATAATTG 


1080 


gtttgagaaa 


CAAACGGATG 


GGACACTTGA 


CATTTCTTGT 


CCTCGTTGGA 


TGGATTGGTT 


1140 


tcatggtgga 


TTGCAATTCC 


AAATTGAGCA 


TCATTTGTTT 


CCCAAGATGC 


CTAGATGCAA 


1200 


ccttaggaaa 


ATCTCGCCCT 


ACGTGATCGA 


GTTATGCAAG 


AAACATAATT 


TGCCTTACAA 


1260 


TTATGCATCT 


TTCTCCAAGG 


CCAATGAAAT 


GACACTCAGA 


ACATTGAGGA 


ACACAGCATT 


1320 


GCAGGCTAGG 


GATATAACCA 


AGCCGCTCCC 


GAAGAATTTG 


GTATGGGAAG 


CTCTTCACAC 


1380 


TCATGGTTAA 


AATTACCCTT 


AGTTCATGTA 


ATAATTTGAG 


ATTATGTATC 


TCCTATGTTT 


1440 


GTGTCTTGTC 


TTGGTTCTAC 


TTGTTGGAGT 


CATTGCAACT 


TGTCTTTTAT 


GGTTTATTAG 


1500 


ATGTTTTTTA 


ATATATTTTA 


GAGGTTTTGC 


TTTCATCTCC 


ATTATTGATG 


AATAAGGAGT 


1560 


TGCATATTGT 


CAATTGTTGT 


GCTCAATATC 


TGATATTTTG 


GAATGTACTT 


TGTACCACTG 


1620 


TGTTTTCAGT 


TGAAGCTCAT 


GTGTACTTCT 


ATAGACTTTG 


TTTAAATGGT 


TATGTCATGT 


1680 


TATTT 












1685 



I NFO RMAT I ON FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 448 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE:. Protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ala Ala Gin lie Lys Lys Tyr lie Thr Ser Asp Glu Leu Lys Asn 
1 .5 10 15 

His Asp Lys Pro Gly Asp Leu Trp lie Ser lie Gin Gly Lys Ala Tyr 
20 25 .30 

Asp Val Ser Asp Trp Val Lys Asp His Pro Gly Gly Ser Phe Pro Leu 
35 40 45 

Lys Ser Leu Ala Gly Gin Glu Val Thr Asp Ala Phe Val Ala Phe His 
50 55 60 

Pro Ala Ser Thr Trp Lys Asn Leu Asp Lys Phe Phe Thr Gly Tyr Tyr 
65 70 75 80 

Leu Lys Asp Tyr Ser Val Ser Glu Val Ser Lys Asp Tyr Arg Lys Leu 

85 90 - 95 

Val Phe Glu Phe Ser Lys Met Gly Leu Tyr \sp Lys Lys Gly His lie 
100 105 110 

Met Phe Ala Thr Leu Cys Phe lie Ala Met Leu Phe Ala Met Ser Val 
115 120 125 

Tyr Gly Val Leu Phe Cys Glu Gly Val Leu Val His Leu Phe Ser Gly 
130 135 140 

Cys Leu Met Gly Phe Leu Trp lie Gin Ser Gly Trp lie Gly His Asp 
145 150 155 160 

Ala Gly His Tyr Met Val Val Ser Asp Ser Arg Leu Asn Lys Phe Met 

165 170 175 

Gly lie Phe Ala Ala Asn Cys Leu Ser Gly lie Ser lie Gly Trp Trp 
180 185 190 

Lys Trp Asn His Asn Ala His His lie Ala Cys Asn Ser Leu Glu Tyr 
195 200 205 

Asp Pro Asp Leu Gin Tyr lie Pro Phe Leu Val Val Ser Ser Lys Phe 
210 215 220 



Phe Gly Ser Leu Thr Ser His Phe Tyr Glu Lys Arg Leu Thr Phe Asp 
225 230 235 240 

Ser Leu Ser Arg Phe Phe Val Ser Tyr Gin His Trp Thr Phe Tyr Pro 

245 250 255 

lie Met Cys Ala Ala Arg Leu Asn Met Tyr Val Gin Ser Leu lie Met 
260 265 270 

Leu Leu Thr Lys Arg Asn Val Ser Tyr Arg Ala Gin Glu Leu Leu Gly 
275 280 285 

Cys Leu Val Phe Ser lie Trp Tyr Pro Leu Leu Val Ser Cys Leu Pro 
290 295 300 

Asn Trp Gly Glu Arg lie Met Phe Val lie Ala Ser Leu Ser Val Thr 
305 310 315 320 

% Gly Met Gin Gin Val Gin Phe Ser Leu Asn His Phe Ser Ser Ser Val 
S 325 330 335 

Vs i Tyr Val Gly Lys Pro Lys Gly Asn Asn Trp Phe Glu Lys Gin Thr Asp 
*H 340 345 350 

Gly Thr Leu Asp lie Ser Cys Pro Pro Trp Met Asp Trp Phe His Gly 

"0 1 355 360 365 

H Gly Ser Gin Phe Gin lie Glu His His Leu Phe Pro Lys Met Pro Arg 

rtJ 370 375 380 

H Cys Asn Leu Arg Lys lie Ser Pro Tyr Val lie Glu Leu Cys Lys Lys 

O 385 390 395 " ~ 400 

5,':, 

His Asn Leu Pro Tyr Asn Tyr Ala Ser Phe Ser Lys Ala Asn Glu Met 

405 410 415 

Thr Leu Arg Thr Leu Arg Asn Thr Ala Leu Gin Ala Arg Asp lie Thr 
420 425 430 

Lys Pro Leu Pro Lys Asn Leu Val Trp Glu Ala Leu His Thr His Gly 
435 440 445 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 




Trp He Gly His Asp Ala Gly His 
1 5 



(2) INFORMATION FOR SEQ ID NO: 7: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 

Asn Val Gly His Asp Ala Asn His 
1 5 

(2 )□ INFORMATION FOR SEQ ID NO : 8 : 

Ctf (i) SEQUENCE CHARACTERISTICS: 

Q (A) LENGTH: 8 amino acids 

[|i (B) TYPE: amino acid 

m (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

jJl(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

5 W 

tT Val Leu Gly His Asp Cys Gly His 
u 1 5 
(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (peptide) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Val He Ala His Glu Cys Gly His 
1 5 



->5- 



(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 

Val lie Gly His Asp Cys Ala His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

O (A) LENGTH: 8 amino acids 

q (B) TYPE: amino acid 

ry (D) TOPOLOGY: linear 

Ct(ii) MOLECULE TYPE: Peptide 

SJ(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

: s Val Val Gly His Asp Cys Gly His 

it: i 5 

ru 

(2 ^INFORMATION FOR SEQ ID NO: 12: 
H (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12 

His Asn Ala His His 
1 5 

(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (peptide) 



-y 6 - 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 

His Asn Tyr Leu His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

His Arg Thr His His 

M, 1 5 

(2)qINFORMATION FOR SEQ ID NO: 15: 

% (i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 5 amino acids 

z] (B) TYPE: amino acid 

y! (D) TOPOLOGY: linear 

g 

H(ii) MOLECULE TYPE: Peptide 

fU(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15 

Si : 

Q His Arg Arg His His 

M= 1 5 



(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

His Asp Arg His His 
1 5 



1< 



(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 

His Asp Gin His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

O (A) LENGTH: 5 amino acids 

O (B) TYPE: amino acid 

jfy (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

m (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 

His Asp His His His 

rjj 1 5 

(2g INFORMATION FOR SEQ ID NO: 19: 
P (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 

His Asn His His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 

Phe Gin He Glu His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

His Gin Val Thr His His 
1 5 

(2 g INFORMATION FOR SEQ ID NO:22: 

r y (i) SEQUENCE CHARACTERISTICS: 

^4 (A) LENGTH: 5 amino acids 

U1 (B) TYPE: amino acid 

01 (D) TOPOLOGY: linear 

N= (ii) MOLECULE TYPE: Peptide 

ry (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

S His Val He His His 

U 1 5 

(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

His Val Ala His His 
1 5 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

His lie Pro His His 

1.5 

(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

k-Xii) MOLECULE TYPE: Peptide 

rfxi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

;SHis Val Pro His His 

Cll 5 

H '■Pi 

y I 

2) INFORMATION FOR SEQ ID NO: 26: 

S (i) SEQUENCE CHARACTERISTICS: 

JfJ (A) LENGTH: 1702 base pairs 

: y (B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 
U (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 48.. 1406 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 48.. 1406 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

CCCCAAAAAT TTTCATTGTT CTCCATCTGG ACCACAGCAT CCACACA ATG GAG GGC 56 

Met Glu Gly 
1 

GAA GCT AAG AAG TAT ATC ACG GCG GAG GAC CTC CGC CGC CAC AAC AAG 104 
Glu Ala Lys Lys Tyr He Thr Ala Glu Asp Leu Arg Arg His Asn Lys 
5 10 15 

TCC GGC GAT CTC TGG ATC TCC ATC CAG GGC AAG GTC TAC GAC TGC TCT 152 



I* 

Ser Gly Asp Leu Trp lie Ser lie Gin Gly Lys Val Tyr Asp Cys Ser 
20 25 30 35 

CGG TGG GCG GCG GAG CAC CCC GGC GGC GAG GTC CCG CTC CTC AGT CTG 
Arg Trp Ala Ala Glu His Pro Gly Gly Glu Val Pro Leu Leu Ser Leu 

40 45 50 

GCC GGC CAG GAC GTC ACC GAC GCC TTC ATT GCG TAC CAC CCG GGC ACG 
Ala Gly Gin Asp Val Thr Asp Ala Phe lie Ala Tyr His Pro Gly Thr 
55 60 65 

GCG TGG CGG CAT CTG GAT CCG CTC TTC ACC GGC TAC TAC TAC CTC AAG 
Ala Trp Arg His Leu Asp Pro Leu Phe Thr Gly Tyr Tyr Tyr Leu Lys 
70 75 80 

GAC TTC GAA GTG TCG GAG ATC TCC AAG GAC TAC CGG AGG CTT TTG AAC 
Asp Phe Glu Val Ser Glu lie Ser Lys Asp Tyr Arg Arg Leu Leu Asn 
85 90 95 

GAGQ^TG TCG CGG TCC GGG ATC TTC GAG AAG AAG GGC CAC CAC ATC ATG 
GluQMet Ser Arg Ser Gly lie Phe Glu Lys Lys Gly His His lie Met 
lOOry 105^ 110 115 

TGGqkCG TTC GTC GGC GTT GCG GTC ATG ATG GCG GCA ATC GTC TAC GGC 
Trp^Thr Phe Val Gly Val Ala Val Met Met Ala Ala lie Val Tyr Gly 

120 125 130 

GTG jjpTG GCG TCG GAG TCC GTC GGA GTT CAC ATG CTC TGC GGC GCA CTG 
ValLteu Ala Ser Glu Ser Val Gly Val His Met Leu Cys Gly Ala Leu 
135 140 145 

CTGNSGC TTG CTG TGG ATC CAA GCC GCG TAT GTG GGC CAT GAC TCC GGC 
LeuGlly Leu Leu Trp lie Gin Ala Ala Tyr Val Gly His Asp Ser Gly 
N- 150 155 160 

CAT TAC CAG GTG ATG CCA ACC CGT GGA TAC AAC AGA ATC ACG CAA CTC 
His Tyr Gin Val Met Pro Thr Arg Gly Tyr Asn Arg lie Thr Gin Leu 
165 170 175 

ATA GCA GGC AAC ATC CTA ACC GGA ATC AGC ATC GCG TGG TGG AAG TGG 
lie Ala Gly Asn lie Leu Thr Gly lie Ser lie Ala Trp Trp Lys Trp 
180 185 190 195 

ACC CAC AAC GCC CAC CAC CTC GCC TGC AAC AGC CTC GAC TAC GAC CCC 
Thr His Asn Ala His His Leu Ala Cys Asn Ser Leu Asp Tyr Asp Pro 

200 205 ~ 210 

GAC CTC CAG CAC ATC CCC GTA TTC GCC GTC TCC ACC CGA CTC TTC AAC 
Asp Leu Gin His lie Pro Val Phe Ala Val Ser Thr Arg Leu Phe Asn 
215 220 225 



TCC ATC ACC TCG GTC TTC TAT GGC CGA GTC CTG AAA TTC GAC GAA GTG 
Ser lie Thr Ser Val Phe Tyr Gly Arg Val Leu Lys Phe Asp Glu Val 
230 235 240 



-A- 

GCA CGG TTC CTA GTC AGC TAC GAG CAC TGG ACC TAC TAC CCG GTC ATG 824 
Ala Arg Phe Leu Val Ser Tyr Gin His Trp Thr Tyr Tyr Pro Val Met 
245 250 255 

ATC TTC GGC CGA GTC AAC CTC TTC ATC CAG ACC TTT TTA TTG CTC CTC 872 
lie Phe Gly Arg Val Asn Leu Phe lie Gin Thr Phe Leu Leu Leu Leu 
260 265 270 275 

ACC AGG CGC GAC GTC CCT GAC CGC GCT CTA AAC TTA ATG GGT ATC GCG 920 
Thr Arg Arg Asp Val Pro Asp Arg Ala Leu Asn Leu Met Gly lie Ala 

280 285 290 

GTT TTC TGG ACG TGG TTC CCG CTC TTC GTA TCT TGT CTC CCG AAC TGG 968 
Val Phe Trp Thr Trp Phe Pro Leu Phe Val Ser Cys Leu Pro Asn Trp 
295 300 305 

CCT GAA CGG TTC GGG TTC GTC CTC ATC AGC TTT GCG GTC ACG GCG ATC 1016 
Pro Glu Arg Phe Gly Phe Val Leu lie Ser Phe Ala Val Thr Ala lie 
u, 310 315 320 

CAGpbAC GTC CAG TTC ACG CTC AAC CAC TTC TCC GGC GAC ACA TAC GTG 1064 
Gln ? tHis Val Gin Phe Thr Leu Asn His Phe Ser Gly Asp Thr Tyr Val 

'325 330 335 

GGC r^CC CCC AAG GGC GAC AAC TGG TTC GAG AAG CAG ACG AAA GGG ACG 1112 
Gly#ro Pro Lya Gly Asp Asn Trp Phe Glu Lys Gin Thr Lys Gly Thr 
340'- 345 350 355 

ATCNSAT ATC ACG TGC CCA CCG TGG ATG GAC TGG TTC TTT GGT GGG CTG 1160 
IlePAsp lie Thr Cys Pro Pro Trp Met Asp Trp Phe Phe Gly Gly Leu 
fy 360 365 370 

CAG SPTC CAG TTG GAG CAC CAC TTG TTC CCT AGG CTG CCG CGT GGG CAG 1208 
Gln§=Phe Gin Leu Glu His His Leu Phe Pro Arg Leu Pro Arg Gly Gin 
375 380 385 

CTT AGG AAG ATT GCG CCC TTG GCT CGG GAC TTG TGT AAG AAG CAC GGG 1256 
Leu Arg Lys He Ala Pro Leu Ala Arg Asp Leu Cys Lys Lys His Gly 
390 395 * 400 

ATG CCG TAT AGG AGC TTC GGG TTT TGG GAC GAC GCT AAT GTC AGG ACA 1304 
Met Pro Tyr Arg Ser Phe Gly Phe Trp Asp Asp Ala Asn Val Arg Thr 
405 410 415 

ATT CGG ACG CTG AGG GAT GCG GCG GTT CAG GCG CGT GAC CTT AAT TCG 1352 
He Arg Thr Leu Arg Asp Ala Ala Val Gin Ala Arg Asp Leu Asn Ser 
420 425 430 435 

GCC CCG TGC CCT AAG AAA CTT GGG TAT GGG GAA GCT TAT AAC ACC CAT 1400 
Ala Pro Cys Pro Lys Lys Leu Gly Tyr Gly Glu Ala Tyr Asn Thr His 

440 445 450 



GGT TGA TTGTGGTTTT GTGTTGTGGG TTGGAGGATC TTCTTATTAT TGATTTATGT 
Gly * 



1456 



-72- 

CCACAATATT GAACTGAATA ACCATGGAAG GCACTACGTT CAGCTTAACT TTGCTTAACT 
TTGCTAGCTG GTTGCGTTCC CTTGTTGGGG GCAAAGTGCA GTATTTATTT TCTTATCCCA 
TGTACTTTTT GATTATTGTT CTTATTCGTA TCATAAATAA TTTATTATTG ATTAATTTTT 
GTTGTAGTTG GGTGTC TATA GCAAGTTTAT AATACTGAGA TATATTTTTT TGGTAAAAAA 
AAAAAA 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 53 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



h£ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met ^|flu Gly Glu Ala Lys Lys Tyr He Thr Ala Glu Asp Leu Arg Arg 

1*! 5 10 • 15 

His y|sn Lys Ser Gly Asp Leu Trp lie Ser He Gin Gly Lys Val Tyr 

yi 20 25 30 

Asp leys Ser Arg Trp Ala Ala Glu His Pro Gly Gly Glu Val Pro Leu 

jy 35 .40 45 

Leu jSer Leu Ala Gly Gin Asp Val Thr Asp Ala Phe He Ala Tyr His 

Q50 55 .60 

Pro "Gly Thr Ala Trp Arg His Leu Asp' Pro Leu Phe Thr Gly Tyr Tvr 
65 70 75 80 

Tyr Leu Lys Asp Phe Glu Val Ser Glu He Ser Lys Asp Tyr Ara Ara 

85 9.0 95 

Leu Leu Asn Glu Met Ser Arg Ser Gly He Phe Glu Lys Lys Gly His 
100 105 110 

His He Met Trp Thr Phe Val Gly Val Ala Val Met Met Ala Ala He 
115 120 125 

Val Tyr Gly Val Leu Ala Ser Glu Ser Val Gly Val His Met Leu Cys 
130 135 140 

Gly Ala Leu Leu Gly Leu Leu Trp He Gin Ala Ala Tyr Val Gly His 
145 I 50 155 160 

Asp Ser Gly His Tyr Gin Val Met Pro Thr Arg Gly Tyr Asn Arg He 

165 170 175 



Thr Gin Leu lie Ala Gly Asn lie Leu Thr Gly lie Ser lie Ala Trp 
180 185 190 

Trp Lys Trp Thr His Asn Ala His His Leu Ala Cys Asn Ser Leu Asp 
195 200 205 

Tyr Asp Pro Asp Leu Gin His lie Pro Val Phe Ala Val Ser Thr Arg 
210 215 220 

Leu Phe Asn Ser lie Thr Ser Val Phe Tyr Gly Arg Val Leu Lys Phe 
225 230 235 " 240 

Asp Glu Val Ala Arg Phe Leu Val Ser Tyr Gin His Trp Thr Tyr Tyr 

245 250 255 

Pro Val Met He Phe Gly Arg Val Asn Leu Phe He Gin Thr Phe Leu 
260 265 270 

Leukeu Leu Thr Arg Arg Asp Val Pro Asp Arg Ala Leu Asn Leu Met 

JJ 275 280 285 

Glyfttle Ala Val Phe Trp Thr Trp Phe Pro Leu Phe Val Ser Cys Leu 
£290 295 300 

ProyAsn Trp Pro Glu Arg Phe Gly Phe Val Leu He Ser Phe Ala Val 
305m 310 315 320 

ThrLAla He Gin His Val Gin Phe Thr Leu Asn His Phe Ser Gly Asp 
jV 325 330 335 

ThrJJfyr Val Gly Pro Pro Lys Gly Asp Asn Trp Phe Glu Lys Gin Thr 
q 340 345 350 

Lys^Gly Thr He Asp He Thr Cys Pro Pro Trp Met Asp Trp Phe Phe 
355 360 365 

Gly Gly Leu Gin Phe Gin Leu Glu His His Leu Phe Pro Arg Leu Pro 
370 375 380 

Arg Gly Gin Leu Arg Lys He Ala Pro Leu Ala Arg Asp Leu Cys Lys 
385 390 395 400 

Lys His Gly Met Pro Tyr Arg Ser Phe Gly Phe Trp Asp Asp Ala Asn 

405 410 415 

Val Arg Thr He Arg Thr Leu Arg Asp Ala Ala Val Gin Ala Arg Asp 
420 425 430 

Leu Asn Ser Ala Pro Cys Pro Lys Lys Leu Gly Tyr Gly Glu Ala Tyr 
435 440 445 

Asn Thr His Gly * 
450 



